Rank | Count | Beginning |
---|---|---|
7 | 1712 | The |
4 | 573 | In |
26 | 359 | He |
68 | 218 | This |
39 | 216 | " |
56 | 194 | A |
23 | 146 | She |
50 | 128 | On |
126 | 128 | They |
29 | 127 | There |
64 | 100 | His |
45 | 88 | However, |
146 | 78 | These |
12 | 74 | As |
40 | 71 | When |
74 | 67 | For |
209 | 50 | Some |
363 | 44 | By |
36 | 41 | While |
139 | 41 | Unreferenced |
654 | 35 | One |
156 | 34 | An |
129 | 32 | Since |
154 | 32 | Her |
210 | 32 | Many |
217 | 30 | During |
299 | 30 | Their |
44 | 25 | "The |
1430 | 25 | Career |
254 | 23 | Summary |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV